Cell Decomposition and Compactification 

of 

Riemann's Moduli Space 
in 

Decorated Teichmiiller Theory 

R. C. Penner 

Departments of Mathematics and Physics/ Astronomy 
University of Southern California 
Los Angeles, CA 90089 



May 22, 2003 

Abstract This survey covers earlier work of the author as well 
as recent work on Riemann's moduli space, its canonical cell 
decomposition and compactification, and the related operadic 
structure of arc complexes. 



Introduction 



Riemann's moduli space M = M(F) of a surface F has an essentially canonical cell de- 
composition ([Ha]-[Ko]-[St] or [PI]) and admits various interesting compactifications, some 
of which are less or more compatible with the cell decomposition. There are furthermore 
several related operads based upon supersets of M, that is, there is a natural composition 
of appropriate (possibly degenerate) surfaces. These structures on moduli space are use- 
ful tools in several guises on the interfaces of mathematics and physics. For instance and 
essentially by definition, a conformal field theory leads to an algebra over the operad of un- 
compactified moduli space, where the operad structure is induced by gluing surfaces along 
boundary components, and algebras over this operad give Batalin-Vilkovisky structures 
[Ge]. In practice, one is furthermore interested in compactifications, or partial compacti- 
fications, of M; for instance, the Deligne-Mumford compactification [DM] and its operad 
in an appropriate setting correspond to Gromov-Witten theory and quantum cohomology. 
These remarks emphasize the importance of understanding compactifications of M. 
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A basic fact about moduli space M = M(F) is that it admits a natural cell decomposition; 
there are several incarnations of cell decompositions depending upon the type of surface 
F, and we distinguish between "punctured " (surfaces without boundary and with at least 
one puncture) and "bordered" (surfaces with non-empty boundary which may also have 
punctures). As is well-known in the punctured case, M is a non-compact orbifold, which 
comes equipped with a canonical cell decomposition, where the cells in the decomposition 
are in one-to-one correspondence with suitable classes of graphs, or equivalently, with 
suitable classes of arc families in F; this cell decomposition does not extend to the DM 
compactification in any known way. 

In the setting of this paper in the case of a punctured surface Fi, upon choosing a dis- 
tinguished puncture of F\, there is a canonical compactification Arc{F\) of M(Fi), which 
arises as the quotient of the simplicial completion of the natural cell decomposition. In the 
setting of this paper in the case of a bordered surface F 2: there is a natural action of the 
positive reals R>o on M(F 2 ) which is generated by scaling all of the hyperbolic lengths 
of the boundary geodesies, and we shall let M(F 2 ) /Ttyo denote the quotient. There is a 
canonical compactification Arc(F 2 ) of a space proper homotopy equivalent to M(F 2 ) /R>o, 
where Arc(F 2 ) comes equipped with a canonical cell decomposition. We had previously 
conjectured [P3] that in the case of a bordered surface F 2 , Arc(F 2 ) is PL-homeomorphic to 
a sphere and shall discuss in §5 this "sphericity conjecture" . This generalizes to arbitrary 
bordered surfaces a classical combinatorial fact about triangulations of polygons as we shall 
see. In fact, this sphericity of Arc(F 2 ) for any bordered surface F 2 , in turn, implies that 
the compactification Arc{F\) of moduli space is an orbifold, for any punctured surface F\ 
as we shall also see. In particular, Arc{F\) is conjecturally a new orbifold compactification 
of M{F\) for a punctured surface -Fi, where the cell decomposition of M{F\) extends to 
Arc(Fi). 

Other aspects of the geometry and combinatorics of these new compactifications had al- 
ready been described and will be recalled here as well. For instance: whereas the DM 
compactification is regarded in our formalism as "forgetting" the geometric structure on 
sub-annuli of F, our compactification allows one to "forget" geometric structure on any 
sub-surface embedded in F; a point in our compactification gives a canonical decomposi- 
tion of F into respective geometric and topological sub-surfaces, and the topological part is 
non-empty if and only if the point is ideal; whereas Thurston's compactification (see [PH] 
for instance) of Teichmiiller space records only those hyperbolic lengths which diverge, our 
compactification records only those hyperbolic lengths which do not diverge. 

On the algebraic side, Sullivan [CS], [Su], [CJ], [Vo] has introduced a "string prop", which 
is intended to formalize in the language of traditional algebraic topology certain algebraic 
aspects of physical and mathematical theories involving colliding strings. This prop is 
intended to be "universal" in the sense that it should act on the loop space of any man- 
ifold. The precise definition of the appropriate compactification is apparently still being 
formulated. 

In our setting, there are natural operad structures on arc complexes in the bordered case 
which are closely connected to the string prop. These "arc operads" have been studied 
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in [KLP] (and are recalled here in §7 with a more thorough survey given in Kaufmann's 
paper [K2] in this volume, which also surveys his independent work from [Kl]). Thus, this 
paper both surveys older results [P1-P3] (in §§1,2,6) and sketches (and even excerpts at 
points) the newer works [P4] on bordered surfaces (in §3), on sphericity (in §5), and [KLP] 
on the operad structure (in §7). 

We have attempted to include enough background material to provide a meaningful survey, 
for instance, including many of the principal identities of decorated Teichmiiller theory in 
§2 and sketches of proofs, where appropriate. The intention has been to emphasize the 
background material for the compactification and for the operad structure, and there are 
other aspects [P5] of decorated Teichmiiller theory (notably matrix models, universal con- 
structions, homeomorphisms of the circle and Lie algebras, wavelets and Fourier transform) 
which are not discussed here. 

This paper is organized as follows. §1 gives basic definitions and compares the conformal 
treatment of the cell decomposition of moduli space in the punctured case with the hyper- 
bolic treatment of the cell decomposition. §2 describes the several coordinates systems of 
decorated Teichmiiller theory and develops from first principles several deep results. As 
mentioned before, §§3-4 survey our newer work on bordered surfaces, and §5 gives all the 
details of a proof of the sphericity conjecture for punctured polygons. §6 contains older 
work that explains the degeneration of hyperbolic structure in our new compactification, 
and §7 surveys the arc operad and its relationship with the sphericity conjecture. A final 
appendix is intended to give a mathematical overview of folding problems in computa- 
tional biology, employs the arc complex constructions to give new combinatorial models 
for certain biopolymers, and ends with a speculative remark. 

1. Definitions and cell decomposition for punctured surfaces 

Consider a smooth surface F = FL 3 r of genus g > with r > labeled boundary compo- 
nents and s > labeled punctures, where 6g — 7 + 4r + 2s > 0. The surface F is said to 
be bordered if r ^ 0. The pure mapping class group PMC = PMC(F) of F is the group 
of all isotopy classes of orientation-preserving homeomorphisms of F which pointwise fix 
each boundary component and each puncture, where the isotopy is likewise required to 
pointwise fix these sets. 

In the unbordered case r = 0, define the classical Teichmiiller space Tg of F to be the 
space of all complete finite-area metrics of constant Gauss curvature —1 ("hyperbolic 
metrics") on F modulo push- forward by diffeomorphisms fixing each puncture which are 
isotopic to the identity relative to the punctures. Tg is homeomorphic to an open ball of 
real dimension 6g — 6 + 2s, the mapping class group PMC acts on Tg by push-forward 
of metric under representative diffeomorphism, and the quotient M s g = Tg/PMC is by 
definition Riemann's moduli space of F. 

In the unbordered punctured case, where r = and s > 0, there is a trivial R> -bundle 
Tg — > Tg, where the fiber over a point is the set of all s-tuples of horocycles in F, one 
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horocycle about each puncture, and there is a corresponding trivial bundle T g s /PMC = 

Mg — > Mg\ the total spaces of these bundles are the decorated Teichmuller and decorated 
moduli spaces, respectively, and they are studied in [P1-P3]. 

In the bordered case r^O, there are two geometric treatments of distinguished points in 
the boundary, which will be carefully described and compared in §3 leading to two closely 
related models of corresponding decorated moduli spaces. 

The remainder of this section is dedicated to a discussion of the well-known canonical cell 
decomposition of decorated moduli space in the punctured unbordered case r = with 
s > (as we tacitly assume in the remainder of this section), which may be described 
either in the spirit of [P1-P3] ("in the hyperbolic setting") or using quadratic differentials 
[St] ( "in the conformal setting" ) as in [Ha] or [Ko] . Our discussion begins in the conformal 
setting, where moduli space is regarded as the space of all equivalence classes of conformal 
structures under push-forward by any orientation-preserving diffeomorphism. 

A "fatgraph" or "ribbon graph" G is a graph whose vertices have valence at least three plus 
the further structure of a cyclic ordering on the half-edges about each vertex. A fatgraph 
G may be "fattened" to a bordered surface as follows: start with disjoint neighborhoods in 
the plane of the vertices of G, where the cyclic ordering is determined by the orientation 
of the plane; glue orientation-preserving bands to these neighborhoods in the natural way, 
one band for each edge of G, to yield a topological surface Fq 2 G with G a spine of 
Fq. The complement Fq — G is a union of topological annuli i 4 C F G) i = l,...,s>l, 
where each annulus has one boundary component d\ contained in G and the other <9j 
contained in 3Fq. 

Letting E(G) denote the set of edges of G, a "metric" on G is any function w G (R,> ) E ( G \ 
The curve d\ is a cycle on G, and we define the "length" of A, to be £i(w) = ^2 w ( e )? where 
the sum is over e G d\ counted with multiplicity. A metric w is said to be "positive" if 
£i(w) > for each i, and we let o~(G) denote the space of all positive metrics on G. 

Suppose that w G cr(G) is a positive metric with associated lengths £{ > 0, for % = 1, . . . , s. 
We construct a metric surface homeomorphic to F* as follows, where 2 — 2g — s is the 
Euler characteristic of G. Give each Ai = Ai(w) the structure of a flat cylinder with 
circumference £i and height unity, and isometrically identify these cylinders in the natural 
way along common edges in U{t^}f as dictated by the metric and fatgraph. This yields a 
metric structure on Fq so that the boundary component <9j of Fq is a standard circle of 
circumference if, for each % = 1, . . . , s, glue a standard flat disk of circumference £i to each 
di and take the center of the flat disk as a puncture *j to produce a conformal structure 
on a surface which is identified with F*. 

An analytic fact is: 

Theorem A [St;§23.5] Suppose w G o~(G) is a positive metric on the fatgraph G, and 
let £i = £i(w) denote the corresponding lengths, for i = 1, . . . , s. Then there is a unique 
meromorphic quadratic differential q on Fg so that for each i = 1, . . . , s, the non-critical 
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horizontal trajectories of q in F foliate A^w) C Fg by curves homotopic to the cores, and 
the residue of A /g at *i is £i . 

Let n q denote the conformal structure on Fg determined by the quadratic differential q. 
Theorem A thus describes an effective construction 

(G,«i)^^x(^er;xR s >0 . 

Fix a surface F = Fg, and consider the set C s g of all homotopy classes of inclusions of 
fatgraph G C F where G is a strong deformation retract of F. If G C F and w G o~(G), 
then we may produce another G w C F by contracting each edge e G F(G) with w(e) = 
to produce G w . Identifying E(G W ) with {e G F(G) : w(e) 7^ 0} in the natural way, we 
thus induce w' G cr(G w ) by requiring that w'(e) = w(e) whenever w(e) 7^ 0. 



Consider 
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where JJ denotes disjoint union, and (G 1 ,^ 1 ) ~ (G 2 ,w 2 ) if and only if G\ )i C F agrees 
with G 2 U2 C F as members of and «^ G cr(G 1 ) agrees with G a(G 2 ). PMC(F) acts 
on C7i induced by (0 : G — > F) 1— > (/ o : G — > F) for any diffeomorphism / : F — > F, 
and the quotient "fatgraph complex" is G s g = Ug/PMC; let [G,w] G denote the class 
of (G, w) G Ug. 

Theorem B [S;§25.6] and [HM] (G, w) 1— > fj, q x (£i)f induces real-analytic homeomorphisms 
U° -> T ff s x R^ and -> M ff s x R^ . 

By Theorem A, the mapping G s g — > Mf x RS^ is well-defined and injective, and by Theorem 
B, this mapping is surjective (but its inverse requires solving the Beltrami equation and is 
highly non-computable). There is thus a FMC-invariant cell decomposition of Tg x R> 
induced by the cell structure of G s g , which is the main result in the conformal setting 
positing the existence of a cell decomposition of M s g . 

In the hyperbolic setting, fix a (conjugacy class of) Fuchsian group V uniformizing a point 
of Tg, and specify a collection of horocycles, one horocycle about each puncture of F = Fg 

(called a "decoration"); these data uniquely determine a point T G T g s by definition. The 
main ingredient for the cell decomposition of moduli space in the hyperbolic setting is 
the "convex hull construction" [PI; §4], which assigns to V G T g s a corresponding point 
(Gp, wp) G Ug as follows: 

Via affine duality, we may identify the open positive light-cone L + in Minkowski three- 
space with the collection of all horocycles in the hyperbolic plane (cf. §2) and thus produce 
a T-invariant set B C L + corresponding to the decoration, where we regard V as acting via 
Minkowski isometries. In fact, B is discrete in L + , and we may consider the closed convex 
hull of B in the vector space structure underlying Minkowski space. The extreme edges of 
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the resulting T-invariant convex body project to a collection of disjointly embedded arcs 
ctp connecting punctures, and each component of F — Uojf is simply connected; we shall 
say that such an arc family "fills" F. Given any arc family a filling F, we may define 
a subset 

C(a) = {r e Tg : CKp is homotopic to a}. 

The Poincare dual of the cell decomposition Ua r of F is a fatgraph G embedded as a spine 
of F. The explicit formula in terms of Minkowski geometry for the "simplicial coordinate" 
of an edge of G, gives a positive metric w-p on G; the simplicial coordinates are effectively 
the signed volume of the corresponding simplex with light-like vertices, as we shall describe 
in §2. 

In the conformal setting, the effective construction maps G s g — > {M* x R> ), while in 
the hyperbolic setting, the effective construction (i.e., the convex hull construction) maps 
Mg — > G s g in the opposite direction! Thus, each of the hyperbolic and conformal treatments 
has its difficult theorem: surjectivity of the effective construction. The proof of surjectivity 
in decorated Teichmiiller theory devolves to showing that the putative cells C(a) C T g s are 
in fact cells, and this cellularity is proven in [PI; §5] (independent of the Jenkins- Strebel 
theory) as described further in §2. 

Just as the conformal setting has a non-computable inverse (Mg x R> ) — > G s g , there is a 
non-computable (or at least, very difficult to compute) inverse G s g — > M g in the hyperbolic 
setting. These "arithmetic problems" are studied in [P2] and will be discussed in §2; a 
solution to the arithmetic problem is thus a solution to an appropriate Beltrami equation. 

We believe that one may combine bounded distortion results of Epstein-Marden/Sullivan 
from the 1980's with arguments from [PI, §6] and prove that the hyperbolic construction 
followed by the conformal construction has bounded distortion on open moduli space (and 
the details of this will appear elsewhere). For instance, one can thus view the arithmetic 
problem as the calculation of hyperbolic geometry from conformal combinatorics, up to a 
bounded error. 

It is worth emphasizing that the conformal and hyperbolic versions of the cell decomposi- 
tion of Tg agree combinatorially but not pointwise. Since PMC acts more or less discretely 
on T* it is not surprising that a given cell decomposition can be "jiggled" to produce a 
combinatorially equivalent but pointwise distinct cell decomposition. (On the other hand, 
another hyperbolic version of the cell decomposition due to Bowditch-Epstein was shown 
in [P2] to agree pointwise with the cell decomposition based on the convex hull construc- 
tion.) Finally, there is further structure in the hyperbolic setting without analogue in the 
conformal setting as discussed in §2-3 (from [PI] in the punctured and [P4] in the bordered 
case) . 

2. Coordinates on Teichmiiller space for punctured surfaces 

There are several coordinatizations of the Teichmiiller space T g s = T(F g ) of a punctured 
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unbordered surface which we shall next describe. All depend upon the Minkowski inner 
product < •, • > on R 3 whose quadratic form is given by x 2 +y 2 —z 2 in the usual coordinates. 
As is well-known, the upper sheet H = {u = (x, y, z) G R 3 :< u, u >= —1 and z > 0} of 
the two-sheeted hyperboloid is isometric to the hyperbolic plane. Furthermore, the open 
positive light cone L + = {u = (x, y, z) e R 3 :< it, u >= and z > 0} is identified with the 
collection of all horocycles in H via the correspondence u i— > h{u) = {w G H :< w,u >= 

-!}• 

The first basic building block to be understood is a "decorated geodesic", by which we 
mean a pair of horocycles ho, hi in the hyperbolic plane with distinct centers, so there 
is a well-defined geodesic connecting the centers of ho and hi; the two horocycles may 
or may not be disjoint, and there is a well-defined signed hyperbolic distance 5 between 
the horocycles (taken to be positive if and only if ho D hi = 0) as illustrated in the 
two cases of Figure 1. The lambda length of the pair of horocycles {ho, hi} is defined 
to be the transform X(ho,hi) = \/2 exp 6. Taking this particular transform renders the 
identification h geometrically natural in the sense that X(h(uo), h(ui)) = ^J— < uq, u[ >, 
for uq, ui E L+ as one can check. 




Figure 1 Decorated geodesies. 

By an arc family in F*, we mean the isotopy class of a family of essential arcs disjointly 
embedded in Fg connecting punctures, where no two arcs in a family may be homotopic 
rel punctures. If a is an arc family so that each component of F — Ua is a polygon, then we 
say that a fills F, and in the extreme case that a is maximal so that each complementary 
component is a triangle, then a is called an ideal triangulation. 

Let us choose an ideal triangulation r of Fg once and for all and regard this as a "choice 
of basis" for the first global coordinatization: 

Theorem 1 [PI; Theorem 3.1] For any ideal triangulation t of F , the assignment of 
lambda lengths defines a real-analytic homeomorphism 

l g — > rt >0 . 
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For the proof, we refer the reader to [PI], which gives an effective construction of the 
underlying Fuchsian group from the lambda lengths on r (and is the decorated version of 
an easy case of Poincare's original construction of Fuchsian groups). 

The next basic building block to be understood is a "decorated triangle" , by which we mean 
a triple ho, hi, hi of horocycles with distinct centers, so these centers determine an ideal 
triangle in the hyperbolic plane. We shall refer to a neighborhood of a cusp of this triangle 
as a "sector" of the triangle, so there is one horocylic segment contained in each sector, and 
we define the h-length of the sector to be half the hyperbolic length of the corresponding 
horocyclic segment; see Figure 2, where we have identified an arc with its lambda length 
for convenience. One can think of the h-length as a kind of angle measurement between 
geodesies asymptotic to a puncture. . 




Figure 2 Decorated triangle. 

Lemma 2 Fix a decorated ideal triangle ho,hi,h2, and define the lambda lengths Xi = 
\{hj,hk), where k} = {0, 1, 2}. Then 

a) [Pl;Proposition 2.8] As in Figure 2, the h-length of the sector opposite Xi is given by 
jfc for {i,j,k} = {0,1,2}. 

b) [P2;Propostion 2.3] There is a point in the hyperbolic plane equidistant to ho, hi, h 2 if 
and only ifXo, Xi, X2 satisfy all three possible strict triangle inequalities, and this equidistant 
point is unique in this case as illustrated in Figure 3; furthermore, for a fixed ideal triangle, 
all points in the hyperbolic plane arise for some decoration on it. 

c) [P5;WP volumes; Theorem 3.3.6] The Weil-Petersson Kdhler two-form is given by 

—2 ^2 dlog Xo A dlog Xi + dlog Xi A cflog A2 + cflog A2 A cflog Ao, 

where the sum is over all triangles complementary to r with edges Xo, Xi, A2 in this cyclic 
order in accordance with the orientation of F. 
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The proofs of a) and b) follow from direct calculation, and c) follows from Wolpert's 
formula plus calculation. 




inside triangle outside triangle 

Figure 3 Equidistant points to triples of horocycles. 



The last basic building block to be understood is a "decorated quadrilateral" by which we 
mean four horocycles with distinct centers, which thus determine an ideal quadrilateral. 
Let us furthermore triangulate this quadrilateral by adding one of its diagonals e and adopt 
the notation of Figure 4 for the nearby lambda lengths, where again we identify an arc 
with its lambda length for convenience. Furthermore, adopt the notation of Figure 4 for 
the nearby h-lengths. Define the simplicial coordinate of the edge e to be 

_ a 2 + b 2 - e 2 c 2 + <P - e 2 . a , . . ^ 
E= abe + ale = + + (7 + <-«- 

In fact, abcdE is the signed volume of the corresponding tetrahedron in Minkowski space, 
as one can check. 




Figure 4 Decorated quadrilateral. 
9 



Lemma 3 [Pl:§2] Adopting the notation of Figure 4, we have 



a) the coupling equation holds: aj3 = 7$; 

b) the cross-ratio of the four ideal vertices is given by 

c) the diagonal f of the quadrilateral other than diagonal e has lambda length given by a 
Ptolemy transformation; / = ac + bd , 

The proofs follow from easy calculation. The coupling equations give a variety determined 
by a collection of coupled quadric equations in the vector space spanned by the sectors, 
where the simpicial coordinates are given by linear constraints, and this variety is identified 
with decorated Teichmiiller space. 

Fix an ideal triangulation r of F* Define a cycle of triangles to be a collection 

of triangles complementary to r so that tj n tj+i = e j7 for each j = 1, ... ,71, and the 
index j is cyclic (so that t n+ i = t\). If the edges of tj are {ej, ej-i, bj}, for j = 1, . . . , n, 
then the collection {bj}™ is called the boundary of the cycle, and the edges {ej} are called 
the consecutive edges of the cycle. A function X : {edges of r} — > R>o satisfies the 
no vanishing cycle condition if for any cycle of triangles with consecutive edges e n+ i = 
ei, . . . , e n , we have YTj=i x ( e j) > °- 

A nifty little fact is the "telescoping property" of simplicial coordinates, as follows: If 
e±, . . . , e n are the consecutive edges of a cycle of triangles with boundary b±, . . . ,b n , and 
the sector in ti opposite bi has h- length hi, then Y^i=i Ei = 2 hi, where the simplicial 
coordinate of edge ej is given by Ei. The proof follows immediately from the definition of 
simplicial coordinates. 

Suppose that F e Tg gives rise via the convex hull construction to the ideal cell decom- 
position ctp. Let us complete ctp to an ideal triangulation r in any manner (triangulating 
the polygonal complementary regions). From the very definition of the convex hull con- 
struction and simplicial coordinate, we find that the simplicial coordinate of each edge is 
non-negative (by convexity) and satisfies the no vanishing cycle condition (since comple- 
mentary regions are polygonal). 

Conversely, the main hard result of decorated Teichmiiller theory is 

Theorem 4 [Pl;Theorem 5. 4] Given an ideal triangulation r and any X e e R>o, for each 
e G t, with no vanishing cycles, there is a unique corresponding assignment of lambda 
lengths in R> determining a decorated hyperbolic structure T G T g s so that ct r = r — {e G 
t : X e = 0}, and the lambda lengths on r realize the putative simplicial coordinates X e . 

The idea of the proof is to consider the vector space spanned by the sectors of an ideal 
triangulation r and introduce the "energy" 
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where the sum is over all edges e of r in the notation of Figure 4. In fact, solving for 
lambda lengths amounts to minimizing K subject to the constraints given by the simplicial 
coordinates, the constrained gradient flow of K is dissipative with a unique minimum, this 
unique constrained minimum of K solves the coupling equations, and this gives the desired 
lambda lengths from a/3 = e~ 2 = 78. 

Thus, the explicit formulas for simplicial coordinates in terms of lambda lengths can be 
uniquely inverted provided the simplicial coordinates are non-negative and satisfy the no 
vanishing cycle condition. This explicit calculation is called the 

Arithmetic Problem [P2] Fix any ideal triangulation of any punctured surface, and 
calculate the lambda lengths from the simplicial coordinates. 

Lemma 5 [Pl;Lemma 5.2] The non-vanishing cycle condition on simplicial coordinates im- 
plies that the lambda lengths on any complementary triangle satisfy all three strict triangle 
inequalities; in particular, the product of the lambda length and the simplicial coordinate 
of an edge is bounded above by four. 

Proof Adopt the notation in the definition of simplicial coordinates for the lambda lengths 
near an edge e. If c+d < e, then c 2 + d 2 — e 2 < —2cd, so the non-negativity of the simplicial 
coordinate E gives < cd[(a — 6) 2 — e 2 ], and we find a second edge-triangle pair so that 
the triangle inequality fails. This is a basic algebraic fact about simplicial coordinates. It 
follows that if there is any such triangle t so that the triangle inequalities do fail for t, 
then there must be a cycle of triangles of such failures. This possibility is untenable since 
if Cj+i > bj + ej, for j = 1, . . .n, where we again identify an arc with its lambda length, 
then upon summing and canceling like terms, we find > ^7=1 bj, which is absurd since 
lambda lengths are positive. 

The second part follows from the elementary observation in the notation of Figure 4 that 

a 2 + b 2 - e 2 c 2 + d 2 - e 2 
eE = h 



ab cd 

i.e., each summand is twice a cosine, and hence eE < 4. q.e.d. 

By taking Poincare duals of filling arc families, there is established an isomorphism between 
fatgraphs and ideal cell decompositions, and the simplicial coordinates are the analogues 
of the metric lengths of edges in the conformal treatment. Indeed, we may restate the 
main theorem of §1 in the hyperbolic setting using arc families as: 

Theorem 6 [Pl;Theorem 5.5] There is a PMC(Fg) -invariant cell decomposition of T g s 
isomorphic to the geometric realization of the partially ordered set of all arc families filling 
F s 

9 ' 
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3. Bordered surfaces 

We next turn our attention to bordered surfaces. Not only does much of the decorated 
Teichmiiller theory extend naturally to this setting, but there are also new aspects of the 
theory to be described. 

In this section, we shall assume only that r ^ (and allow both cases s = or s > 0). 
Enumerate the (smooth) boundary components of F as <9j, where % = l,...,r, and set 
d = U{d i }\. 

Let Hyp(F) be the space of all hyperbolic metrics with geodesic boundary on F. Define 
the moduli space to be 

r 

M = M(F) = [Hyp(F) x ([[di)]/ ^ 

l 

where ~ is the equivalence relation generated by push-forward of metric under orientation- 
preserving diffeomorphism 

/*(r,(6)I)) = (/*(r),(/(e 4 )l), 

where V i— > /*(r) is the usual push-forward of metric on Hyp(F). 

Let £i(T) denote the hyperbolic length of di for V G Hyp(F), and define the first model 
M = M(F) of decorated moduli space to be 

{ (r, (6)1, (UK) : r G Hyp(F),£i G di, < U < £i(T), i = 1, . . . , r}/ ~, 

where ~ denotes push- forward by diffeomorphisms on (r, as before, extended by 

the trivial action on (ti)i- A point of M is thus represented by V e Hyp(F) together with 
a pair of points £j 7^ in each component di, where pi is the point at hyperbolic distance 
ti along di from £j in the natural orientation. (In the special case g = = s = r — 2 where 
F is an annulus, we define M(F) to be the collection of all configurations of two distinct 
labeled points in an abstract circle of some radius.) 

Turning to the second model for decorated moduli space, begin with a smooth surface F 
with smooth boundary, choose one distinguished point di G di in each boundary compo- 
nent, and set D = {di}\. Define a quasi hyperbolic metric on F to be a hyperbolic metric 
on F x = F — D so that df = di — {di} is totally geodesic, for each i = 1, . . . , r, and set 
d x = U{<9j X }]\ To illuminate the definition, take a hyperbolic metric on a once-punctured 
annulus A with geodesic boundary and the simple essential geodesic arc a in A which is 
asymptotic in both directions to the puncture; the induced metric on a component of A — a 
gives a model for the quasi hyperbolic metric near a component of d x as in Figure 5. 
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Figure 5 The model for F x . 

In this second model, there is a version of decorated Teichmuller space as the space T = 
T(F) of all quasi hyperbolic metrics on F — D, where we furthermore specify for each di 
a horocyclic segment centered at di (again called a "decoration"), modulo push- forward 
by diffeomorphisms of F — D which are isotopic to the identity, where the action is trivial 
on hyperbolic lengths of horocylic segments. In fact, T is homeomorphic to an open 
ball of_dimension 6g — 6 + 5r + 2s. The second model for the decorated moduli space is 
M = M(F) = T{F) /PMC(F) . 

For a fixed quasi hyperbolic metric on F, there is a unique separating geodesic d* C F in 
the homotopy class of di . Excising from F — U{<9*}^ the components containing points 
of <9 X yields a surface F* , which inherits a hyperbolic metric with geodesic boundary. (In 
the special case that F is an annulus, the surface F* collapses to a circle.) T G T has its 
underlying hyperbolic metric given by a conjugacy class of Fuchsian group V for F*. 

Define an arc in F to be a smooth path a embedded in F whose endpoints lie in D and 
which meets dF transversely, where a is not isotopic rel its endpoints to a path lying in 
dF — D. Two arcs are parallel if there is an isotopy between them fixing D pointwise. 
An arc family in F is the isotopy class of a set of disjointly embedded arcs in F, no two 
components of which are parallel. A collection a of arcs representing an arc family in F 
is said to meet efficiently an arc or curve C embedded in F if there are no bigons in F 
complementary to a U C. 

If a is an arc family in F where each component of F — Ua is a polygon or a once-punctured 
polygon, then we say that a quasi fills F. In the extreme case that each component is 
either a triangle or a once-punctured monogon, then a is called a quasi triangulation. 

Theorem 7 [P4;Theorem 1] For any quasi triangulation r of F, the assignment of lambda 
lengths defines a real-analytic homeomorphism 

Prs r tU9 x 
2 g,r ~^ rt >0 



The proof is verbatim that of [PI; Theorem 3.1] briefly mentioned before. In fact, much 
of the number-theoretic utility of lambda lengths in the punctured case [P1-P3] extends 
without great difficulty to the bordered case as discussed in further detail in the Side- 
Remark after Theorem 1 of [P4]. 

One introduces simplicial coordinates in the bordered case in analogy to the previous 
discussion, where if e G <9 X , then it bounds a triangle on only one side, say with edges 

2 4-h 2 — 2 

a, b, e, and we define its simplicial coordinate to be E = 2 a + , — — . 
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Fix a quasi triangulation r of F, and define the subspace 

C(r) = {(y, x) G R x (R> ) T : there are no vanishing cycles or arcs} 



where the coordinate functions are the simplicial coordinates (rather than the lambda 
lengths as in the previous theorem). We say that there are "no vanishing cycles" as 
before for a point of C(r) if there is no essential simple closed curve C C F meeting a 
representative r efficiently so that 

= ^ x p, 

peCnur 

where p G C fl a, for a G r, contributes to the sum the coordinate x p of a. We say that 
there are "no vanishing arcs" for a point of C(r) if there is no essential simple arc ACF 
meeting r efficiently and properly embedded in F with endpoints disjoint from D so that 

peAnd x peArir 

where again x p and y v denote the coordinate on a at an intersection point p = A fl a or 
p = C H a for a G r. This is a convex constraint on y for fixed 

Just as in the case of punctured unbordered surfaces, V gives rise to a quasi filling arc 
family a r via the convex hull construction; fixing the topological type of ctf and varying 

T then gives a cell in a natural decomposition of T. 

Theorem 8 [P4; Theorem 3] A point Y G T gives rise to a gwasi filling arc family ctf 
via the convex hull construction as well as a tuple of simplicial coordinates (y, x) G C(r) 
for any quasi triangulation r 3 «p, where x vanishes onr — a^. Furthermore, this gives a 
real-analytic homeomorphism of the decorated Teichmuller space of F with [Ur^( r )]/ ~> 
where (ti, yi, x-\) ~ (t2, ^2, ^2) ^1 agrees with x^ on t\ — {a G Ti : x 1 = 0} = T2 — {a G 
T2 : x 2 = 0}, where x J denotes the x coordinate on a, for j = 1, 2, and y± = i/2- 

Define a variant of the decorated Teichmuller space 

T = T(F)=[)JC(t)}/~, 

T 

where ~ is as in Theorem 8, and for membership in C{r) we demand that there are no 
vanishing cycles or arcs and for any triangle t C F complementary to r, and the lambda 
lengths on the edges of t satisfy all three possible strict triangle inequalities. Using cycles 
of triangles, the quotient M. = T/PMC can be shown [P4;Lemma 5] to be a strong 
deformation retract of M. Using the existence of equidistant points to suitable triples of 
horocycles as in Lemma 2b in order to determine distinguished points, one can then prove 
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Theorem 9 [P4; Theorem 8] There is a real-analytic homeomorphism between M. and M . 

Thus, our two models M and M. of decorated moduli space are homotopy equivalent (and 
indeed are homeomorphic except for the required passage to M. C M). 

4. The arc complex of a bordered surface 

Let us inductively build a simplicial complex Arc'(F), where there is one p-simplex a (a) 
for each arc family a in F of cardinality p+1; the simplicial structure of a(a) is the natural 
one, where faces of a(a) correspond to sub arc families of a. Identifying the open standard 
p-simplex with the collection of all real- projective (p + l)-tuples of positive reals assigned 
to the vertices, Arc'(F) is identified with the collection of all arc families in F together 
with a real-project ive weighting of non-negative real numbers, one such number assigned 
to each component of a. 

PMC(F) acts on Arc'(F) in the natural way, and we define the arc complex of F to be 
the quotient 

Arc(F) = Arc' '(F)/ PMC '(F). 

If a is an arc family in F with corresponding simplex a (a) in Arc'(F), then we shall let 
[a] denote the PMC(F)-orbit of a and o~[a] denote the quotient of a (a) in Arc(F). A 
subspace of Arc(F) of special interest is the space of quasi-filling arc families with positive 
projective weight 

Arc#(F) = {a [a] : a quasi — fills F}. 

Let M(F)/R > o denote the quotient of the moduli space of the bordered surface F by 
the R>o-action by homothety on the tuple of hyperbolic lengths of the geodesic boundary 
components. A further analysis of the homeomorphism in Theorem 9 shows 

Theorem 10 [P4;Theorem 14] For any bordered surface F ^ Fq 2 , Arc#(F) is proper 
homotopy equivalent to M(F)/H > q. 

Furthermore, each of M(F)/R >0 and Arc#(F) admit natural (S' 1 ) r -actions (moving dis- 
tinguished points in the boundaries for the former and "twisting" arc families around the 
boundary components for the latter); the proper homotopy equivalence in Theorem 10 is in 
fact a map of (5' 1 ) r -spaces. This is the main result of [P4]. It follows from Theorem 10 plus 
the sphericity conjecture 12 discussed in the next section that M(F)/R>o compactifies to 
a sphere. 

Notice that the space of all projectivized laminations VC(F) is itself a sphere (cf. [PH]) 
which in particular contains the complex Arc'(F). It is well-known that VC(F) / PMC(F) 
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is a space whose largest Hausdorff quotient is a singleton, yet it contains as a dense open 
set the quotient Arc(F), whose further study we undertake in the next section. 



5. Sphericity 

Extending the notation from before, let F = F s - denote a fixed smooth surface with 

genus g > 0, s > punctures, and r > 1 boundary components, where the i th boundary 
component comes equipped with Si > 1 distinguished points di as well, for i = 1, . . . , Si 
and 6 = (Si, 62, ■ ■ ■ S r ). 

Let Arc(F) = Arc' (F)/ PMC (F) denote the arc complex of F as defined before, and recall 
the following combinatorial fact. 

Classical Fact 11 For F = Ffj ^ n y Arc(F) = Arc#(F) = Arc'(F) is PL-homeomorphic 
to the sphere of dimension n — 4 provided n > 4. 

Gian-Carlo Rota told us that sphericity for polygons was almost certainly known to Whit- 
ney. This is a special case of the following general conjecture. 

Sphericity Conjecture 12 [P3;Conj.B] The arc complex Arc(F) is PL-homeomorphic 
to the sphere of dimension 6g — 7 + 3r + 2s + Si + 62 + ■ ■ ■ + S r provided this number is 
non-negative. 

This result has been a long-standing goal for us. We had proved this conjecture for g = 
= r — 1 in [P3] and shall recall this proof and add all the details here. New tools have 
arisen to address the general sphericity conjecture in collaboration with Dennis Sullivan. 

Let 

D = D{F) = U{di:i = l,2,...,r} 
denote the set of all the distinguished points in the boundary of F, 

A = A(F) =d 1 +S 2 + --- + S r , 

be the cardinality of D, and set 

N = N(F) = 6g - 7 + 3r + 2s + A 

to be the dimension of the arc complex. Furthermore, in the special case that A = r so 
6 is an r-dimensional vector each of whose entries is unity, then we shall also sometimes 
write Fg r = F s Notice that N > except for the triangle ^0 (3) an< ^ once-punctured 

monogon F^ = F^ iy 

It is convenient in the sequel to imagine weighted arcs in the spirit of train tracks [PH] as 
follows. If a is an arc family, then a weighting on a is the assignment of a non-negative 
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real number, the weight, to each arc comprising a, and the weighting is positive if each 
weight is positive. A projective weighting is the projective class of a weighting on an arc 
family. Since the points of a simplex are described by "projective non-negative weightings 
on its vertices", we may identify a point of a(a) with a projective weighting on a. The 
projective positive weightings correspond to the interior Int a(a) of cr(a), and Arc'(F) 
itself is thus identified with the collection of all projective positively weighted arc families 
in F. 

Suppose that a = {ao, . . . , a p } is an arc family. A useful realization of a positive weighting 
w = (wi)o on the respective components of a is as a collection of p+1 "bands" (3i disjointly 
embedded in the interior of F with the width of $ given by Wi, where cii traverses the length 
of Pi, for i = 0, . . . ,p. The bands coalesce near the boundary of F and are attached one- 
to-the-next in the natural manner to form "one big band" for each of the A distinguished 
points as illustrated in Figure 6. (More precisely, we are constructing here a "partial 
measured foliation from a weighted arc family regarded as a train track with stops"; for 
further details, see [PH] or [KLP;§1].) 




Figure 6 The band picture of a weighted arc family. 

Given an arc family a in F, consider the surface F — Ua. The non-smooth points or 
cusps on the frontiers of the components of F — Ua give rise to distinguished points in the 
boundary of F — Ua in the natural way, and this surface together with these distinguished 
points is denoted F a . Our basic approach to sphericity is by induction on N(F) > N(F a ), 
and more will subsequently be said on this point. 

We begin the discussion of sphericity with a number of examples. 

Example 1 If g = s = and r = 1, then F is a polygon with 6\ vertices, N = Si — 4, 
PMC(F) is trivial, and we are in the setting of the Classical Fact. When 8\ = 4 with 
F a quadrilateral, we have A = 0, and there are exactly two chords, which cannot be 
simultaneously disjointly embedded. Thus, Arc(F) = Arc'(F) consists of two vertices, 
namely, Arc(F) is the 0-sphere. (The reader may likewise directly investigate the case 
A = 1 where F is a pentagon, and the usual "pentagon relation" shows that Arc(F) 
itself is also a pentagon.) 
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More generally, in order to handle all polygons inductively, we recall 

Lemma 13 If F' arises from F by specifying one extra distinguished point in the boundary 
of F , then Arc(F') is PL-homeomorphic to the suspension ofArc(F) provided Arc(F) ^ 0. 

Proof If p is the extra distinguished point of F' in the boundary of F, let po, Pi,p, P2 be the 
consecutive distinguished points lying in a single boundary component of F' in its induced 
orientation, where the pj may not be distinct. There is a natural mapping of Arc'(F) x [0, 1] 
to Arc'(F') using partial measured foliations gotten by sliding proportion t G [0, 1] of the 
band that hits p\ over to p, which descends to an embedding of (Arc(F) x [0, 1])/ ~) to 
Arc(F'); the equivalence relation ~ collapses the interval [0, 1] to a point for any arc family 
that does not hit p\. There is an arc a\ near the boundary which connects p\ to pi and 
another arc ao near the boundary which connects po and p, each of which is essential in 
F' but either non-essential or non-existent in F. 

The surface F^ a ^ has two components, one of which is a triangle, and the other of which 
is homeomorphic to F. The collection Ci of projectively weighted arc families in F' which 
either contain or to which we may add a\ is thus homeomorphic to the topological join 
d [a,i] * Arc(F) of Arc(F) with a point, namely, the vertex a[{ai}] corresponding to a^, 
for i = 0,1. 

Identifying the copy of Arc(F) lying in Ci with 

[(Arc(F) x {%})/ ~] C [(Arc(F) x {0, 1})/ ~] 

in the natural way for % = 0, 1, we find a mapping of the suspension of Arc(F) onto Arc(F') 
which is evidently a PL-homeomorphism. q.e.d. 

Proof of the Classical Fact Induct on the number 81 > 4 of distinguished points using 
Lemma 13 starting from the basis step 81 = 4 treated in Example 1 (using the fact that 
the suspension of a PL-sphere is again a PL-sphere of one greater dimension). q.e.d. 

Example 2 If g = 0, r = s = 1, and 8± = 2, then F is a once-punctured bigon. Any arc 
in F with distinct endpoints must be inessential, and any arc whose endpoints coincide 
must be separating. There are two complementary components to a, one of which has 
one cusp in its frontier, and the other of which has two cusps. The component of the 
former type must contain the puncture and of the latter type must be a triangle. It 
follows that Arc(F) = Arc'(F) is in this case again a zero-dimensional sphere, and 
Lemma 13 applies as before to prove sphericity whenever g = and r = s = 1. 

As in this example, if a is a separating arc whose endpoints coincide with the point p in 
any surface, then its two complementary components are distinguished by the fact that 
one component has exactly one cusp arising from p, and the other component has exactly 
two cusps arising from p, and we shall refer to these respective complementary regions as 
the "one-cusped" and "two-cusped" components. In particular, any arc in a planar surface 
whose endpoints coincide is necessarily separating. 
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Example 3 If g = s = 0, r = 2, and 5\ = 5 2 = 1, then F is an annulus with one 
distinguished point on each boundary component. It is elementary that any essential 
arc must have distinct endpoints and its isotopy class is classified by an integral "twist- 
ing number", namely, the number of times it twists around the core of the annulus; 
furthermore, two non-parallel essential arcs can be disjointly embedded in F if and 
only if their twisting numbers differ by one. Thus, Arc'(F) is in this case isomorphic 
to the real line, the mapping class group PMC(F) is infinite cyclic and generated by 
the Dehn twist along the core of the annulus, which acts by translation by one on the 
real line, and the quotient is a circle. Again, Lemma 13 extends this to any surface 
with g = s = and r = 2. 

Example 4 If g = 0, s = 2, and r = 1 = 8\, then F is a twice-punctured monogon. If 
a is an essential arc in F, then its one-cusped component contains a single puncture, 
which may be either of the two punctures in F, and the two-cusped component contains 
the other puncture of F. Labeling the punctures of F by and 1, choose disjointly 
embedded arcs a« so that ai contains puncture i in its one-cusped complementary 
component, for i = 0, 1. The Dehn twist r along the boundary generates PMC{F), 
and the 0-skeleton of Arc'(F) can again be identified with the integers, where the vertex 
o"({r J (ai)}) is identified with the integer 2j + i for any integer j and i = 0, 1. 

Applying Example 2 to the two-cusped component of F{ a .y, for i = 0, 1, we find exactly 
two one-simplices incident on a({cii}), namely, there are two ways to add an arc disjoint 
from a\ corresponding to the two cusps in the two-cusped component, and these arcs 
are identified with the integers i±l. Thus, Arc'(F) is again identified with the real line, 
PMC(F) acts this time as translation by two, and Arc(F) is again a circle. Lemma 13 
then proves sphericity of Arc(F) for any planar surface with r = 1 and s = 2 

The typical aspect illustrated in this example is that the arcs in an arc family come in 
a natural linear ordering which is invariant under the action of the pure mapping class 
group. Specifically, enumerate once and for all the distinguished points pi,P2, ■ ■ ■ ,Pa in 
the boundary of F in any manner. The boundary of a regular neighborhood of pi in F, 
for % = 1, 2, . . . , A contains an arc A^ in the interior of F, and Ai comes equipped with a 
natural orientation (lying in the boundary of the component of F — A± which contains pi). 
If a is an arc family in F, then there is a first intersection of a component of a with the Ai 
of least index, and this is the first arc a\ in the linear ordering. By induction, the (j + l) st 
arc (if such there be) in the putative linear ordering is the first intersection (if any) of a 
component of a — {ai, . . . , cij-i} with the Ai of least index. 

Since we take the pure mapping class group which fixes each distinguished point and 
preserves the orientation of F, this linear ordering descends to a well-defined linear or- 
dering on the PMC(-F)-orbits in Arc'(F). It follows immediately that the quotient map 
Arc'(F) — * Arc(F) is injective on the interior of any simplex in Arc'(F), so that the 
simplicial complex Arc'(F) descends to a CW decomposition on Arc(F). 

Furthermore, in the action of PMC(F) on Arc'(F), there can be no finite isotropy, and the 
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isotropy subgroup of a simplex a (a) in PMC(F) is either trivial or infinite. The former 
case occurs if and only if a quasi fills the surface F. 

One fundamental inductive tool is: 

Proposition 14 [P3;Lemma 3}Fix a bordered surface F , and suppose that Arc(F a ) is PL- 
homeomorphic to a sphere of dimension N(F a ) for each non-empty arc family a in F . 
Then Arc(F) is an N '(F) -dimensional PL-manifold. 

Idea of Proof The idea is that the link of a simplex a[a] in Arc(F) in the first barycentric 
subdivision is canonically isomorphic to Arc(F a ); this is a general fact about geometric 
realizations of partially ordered sets. Thus, any point x E Arc(F) admits a neighborhood 
which is PL-homeomorphic to an open ball of dimension N = N(F). Of course, x lies in 
the interior Int a[a] of cr[a], for some arc family a in F, and we suppose that a is comprised 
of p + 1 component arcs. We have the identity N + 1 = (N(F a ) + 1) + (p + 1) since both 
sides of the equation give the number of arcs in a quasi-triangulation of F. Since Arc(F a ) 
is PL-homeomorphic to an A^(F a )-dimensional sphere by hypothesis, Int a [a] * Arc(F a ) 
is PL-homeomorphic to an open ball of dimension p + N(F a ) + 1 = N. This gives the 
required neighborhood of x in Arc(F) and proves that Arc(F) is inductively a manifold. 
q.e.d. 

As was mentioned before, our basic approach to sphericity is inductive, and the proof 
depends upon the inductive hypothesis that the sphericity conjecture holds for F a for any 
non-trivial arc family a in F, i.e., by induction on N(F). In order to avoid the Poincare 
Conjecture in dimensions less than four, our basis step involves an explicit analysis of all 
arc complexes of dimension at most four. 

In the special case g = = r — lof multiply punctured polygons, we have already ana- 
lyzed in the examples above (in combination with Lemma 13) all of the arc complexes of 
dimension at most two. In this case, the only remaining complex in dimension three is for 
the surface Fq x and in dimension four is for Fq ^ , which follows from F = Fq x by Lemma 
13. 

To handle the arc complexes of F = Fq l5 for s > 3, label the various punctures of F 
with distinct members of the set S = {1, 2, . . . , s}. If a is an arc family in F and a is a 
component arc of a, then a has a corresponding one-cusped component containing some 
collection of punctures labeled by a proper non-empty subset S(a) C S, which we regard 
as the "label" of a itself. 

More generally, define a "tableaux" r labeled by 5 to be a rooted tree embedded in the 
plane where: the (not necessarily univalent) root of r is an unlabeled vertex, and the other 
vertices of r are labeled by proper non-empty subsets of S; for any n > 1, the vertices of 
r at distance n from the root are pairwise disjoint subsets of S; and, if a simple path in 
r from the root passes consecutively through the vertices labeled Si and £2, then $2 is a 
proper subset of Si. 
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Given an arc family a in F, inductively define the corresponding tableaux r = r(a) in 
the natural way: For the basis step, choose as root some point in the component of F a 
which contains the boundary of F; for the inductive step, given a vertex of r lying in a 
complementary region R of F a , enumerate the component arcs ao, ai, . . . , a m of a in the 
frontier of R, where we assume that these arcs occur in this order in the canonical linear 
ordering described in §2 and ciq separates R from the root. Each arc a, separates R from 
another component Ri of F a , and we adjoin to r one vertex in each such component Ri with 
the label S{aj) together with a one-simplex connecting R to Ri, for each i = 1,2, ... ,m, 
where the one-simplices are disjointly embedded in F. 

It follows immediately from the topological classification of surfaces that PMC(F)-orbits 
of cells in Arc(F) are in one-to-one correspondence with isomorphism class of tableaux 
labeled by S. Furthermore, since the edges of r(a) are in one-to-one correspondence with 
the component arcs of a, a point in Int(a(a)) is uniquely determined by a projective 
positive weight on the edges of r(a). It follows that Arc(F) itself is identified with the 
collection of all such projective weightings on all isomorphism classes of tableaux labeled 



by S. 




k 




Figure 7 The tableaux for Fq ±. 
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Example 5 The surface Fq X . Let us adopt the convention that given an ordered pair 
ij, where i,j G {1, 2, 3}, we shall let k = k(i,j) = {1, 2, 3} — {i, j}, so k actually depends 
only upon the unordered pair i,j. The various tableaux for Fq 1 are enumerated, 
labeled, and indexed in Figure 7, where in each case, ij varies over all ordered pairs 
of distinct members of {1, 2, 3}, k = 1, 2, 3, and the bullet represents the root. In this 
notation and letting d denote the boundary mapping in Arc = Arc(FQ 1 ), one may 
directly compute incidences of cells summarized as follows. 



dCij — Ai Aj, 
dD i:i = Aj - B k , 
dE k = A k - B k , 
9F k = B k — A k , 

9Gij Gij ^ ji ^iji 



dH. 



Dij - C 



jk 



F h 



dlij = — E k + Ckj, 

dJij — Cjj C{ k -\- Cj k f 

9Kij Gij Fl{j ~\~ Hji J ij , 



dLij — Iij — Gij + J] 



'hi 



Iji- 



We may symmetrize and define sub complexes X k = Xij U Xji, for X = K, L, and 
furthermore set M k = K k U L k , f or k = 1,2,3. Inspection of the incidences of cells 
shows that each of K k and L k is a 3-dimensional ball embedded in Arc, as illustrated 
in Figures 8a and 8b, respectively, with Kij on the top in part a) and Lji on the top 
in part b). Gluing together L k and K k along their common faces Gij , Gji, we discover 
that M k is almost a 3-dimensional ball embedded in Arc except that two points in its 
boundary are identified to the single point A k in Arc, as illustrated in Figure 8c. 




Figure 8a K k . Figure 8b L k . Figure 8c M k . 

Figure 8 M k and the balls K k , L k . 



Each M k , for k = 1, 2, 3, has its boundary entirely contained in the sub complex J of 
Arc spanned by 

{A k : k = 1,2, 3} U {Cij, Jij : i,j G {1, 2, 3} are distinct}. 

In order to understand J, we again symmetrize and define J k = Jij U Jji, so each J k 
is isomorphic to a cone, as illustrated in Figure 9; we shall refer to the 1-dimensional 
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simplices Cik,Cjk as the "generators" and to Cij,Cji as the "lips" of Jk- The one- 
skeleton of J plus the cone is illustrated in Figure 9. Imagine taking k = 3 in Figure 
9 and adjoining the cone J<i so that the generator C±2 of J2 is attached to the lip C\i 
of J3, and the generator C13 of J3 is attached to the lip C13 of J2. Finally, J itself is 
produced by symmetrically attaching J\ to J2 U J3 in this lip-to-generator fashion. 




Figure 9 The cone Jk and the one-skeleton of J. 

In order to finally recognize the 3-dimensional sphere, it is best to take a regular neigh- 
borhood of J in Arc, whose complement is a disjoint union of three 3-dimensional balls. 
Each 3-dimensional ball is naturally identified with the standard "truncated" 3-simplex, 
where a polyhedral neighborhood of the 1-skeleton of the standard 3-simplex has been 
excised. These truncated simplices are identified pairwise along pairs of hexagonal faces 
to produce the 3-dimensional sphere in the natural way. 

We next prove that the arc complex Arc(F) minus a point is contractible, for any surface 
F = Fq Let a denote an arc labeled by {s}, and let b denote an arc labeled by S — {s}. 
We shall show that A = Arc(F) — {[a]} strong deformation retracts onto {[b]}. The 
retraction proceeds in four steps, as follows: A retracts to 

B = {[a] E A : no vertex of r(a) is labeled by {s}}, 

B retracts to 

C = { [a] E B : s is not a member of the label of any vertex of r(a)}, 
C retracts to 

V = {[a] E C : 3! vertex of r(a) adjacent to the root labeled by S — {s}}, 
and finally V retracts to the point [b] . 

In order to coherently describe these retractions, let us introduce a natural linear ordering, 
the so-called "pre-ordering" , on the vertices of any tableaux r. Begin with the root of r 



23 



as the first element in the putative pre-ordering. Given a vertex v at distance n > from 
the root, there is a (possibly empty) family vi, v%, . . . , v m of vertices of r at distance n+1 
from the root which are adjacent to v, and this family of vertices comes in the right-to-left 
linear ordering from the definition of a tableaux. Furthermore, each vj is the root of a 
(possibly empty) sub tableaux Tj of r, for j = 1, 2, . . . , m, whose other vertices (if any) are 
at distance at least n + 2 from the root in r. Take the pre-ordering on n, then on T2, and 
so on up to T m , to complete the recursive definition of the pre-ordering on the vertices of 
t itself. 

For the first retraction from A to £>, simply decrease the projective weights, one at a time, 
on the one-simplices of r(a), for [a] G A — £>, which terminate at vertices labeled by {s} 
beginning with the greatest such vertex in the pre-ordering. 

For the second retraction from B to C, consider the greatest vertex v of r(a), for [a] G B—C, 
in the pre-ordering whose label S contains s, and let e be the edge of r(a) terminating 
at v, where e has a projective weight w. Insert a new bi-valent vertex u into e labeled 
by S and re-label v by S — {s}. The edge e is thus decomposed into two edges, and the 
one incident on v is given projective weight tw, while the other is given projective weight 
(1 — t)w, for < t < 1. The corresponding deformation as t goes from to 1 effectively 
replaces the label S on v with the label S — {s} and thus reduces the number of vertices 
whose label contains s. Iteratively removing s from the label of the greatest vertex in this 
manner defines the retraction of B to C. 

For the third retraction, suppose that [a] lies in C — V, and let v denote the root of r(a). 
Adjoin to r(a) a new edge e incident on v, and let the other endpoint of e be the new 
root of the resulting tableaux, where v is labeled by S — {s}. Let w denote the projective 
class of the sum of all of the weights on r(a), and define a projective weight tw on e, for 
< t < 1. The corresponding deformation as t goes from to 1 effectively adds a new 
vertex with label S — {s}. 

Finally, simply decrease the projective weights on all of the edges of r(a) except the edge 
incident on the root in order to contract T> to [b] , completing the proof that the complement 
of a point in the arc complex Atc{Fq ^pis contractible. 

We shall say that a surface F "satisfies the inductive hypothesis" provided Arc(F a ) is 
PL-homeomorphic to a sphere of dimension N(F a ) for each non-empty arc family a in F. 
(For the perspicacious reader, notice that we tacitly assumed this inductive hypothesis in 
the proof of Lemma 13.) The inductive hypothesis thus implies that Arc(F) is a manifold 
by Proposition 14, and this manifold is simply connected and a homology sphere since 
the complement of a point is contractible by the previous discussion. By the Poincare 
Conjecture in high dimensions, it follows inductively that Arc(F) is a sphere for F = F^ 
and hence for F = F^ ^ for any n > 1 by Lemma 13. 

This completes the promised proof of the sphericity theorem for punctured polygons. 
6. Punctured surfaces and fatgraphs 
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We take this opportunity to point out that in [P3] we claimed that our compactification 
maps continuously to the Deligne-Mumford compactification, and this may not be true; the 
precise connection between the two compactifications will be taken up elsewhere. Secondly, 
Theorem 5 of [P3] is slightly corrected by Theorem 17 below. 

We return to the setting F = Fg of punctured surfaces with s > 1 and r = 0. Let us 
choose from among the punctures of F a distinguished one, to be denoted n. Build as 
before the geometric realization A'^{F) of the partially ordered set of all isotopy classes of 
arc families of simple essential arcs which are asymptotic to 7r in both directions (referred to 
subsequently cis circs "based" at n). Consider the punctured arc complex of F = F* given 
by A 7r (F) = A'^(F) I P 'M C '(F) . Notice that given an arc family a in F, each component 
of F a inherits a labeling as a bordered surface, where the ends of F a are taken to be the 
distinguished points in the boundary; this is the basic connection between bordered and 
punctured arc complexes. 

Several of the main results of [P3] are summarized in the following theorem. 

Theorem 15 For any choice of puncture tt, the Teichmiiller space Tg of F = Fg is 
naturally isomorphic to the subspace of A^ = A'^F) corresponding to arc families based 
at 7T which quasi-fill F, so A^ is a compactification of M = M s g . The third barycentric 
subdivision of A' n descends to an honest simplicial complex on the compactification A n of 
M. 

Degenerate structures (i.e., "ideal" points of A n — M) will be fully described presently 
as will a graphical description of cells together with the corresponding matrix-model. For 
the moment, though, we turn to sphericity. In fact, orbifoldicity of A n follows from the 
sphericity conjecture. 

Indeed, as in the proof of Proposition 14, where the link of a simplex a[a] in the second 
barycentric subdivision of Arc(F) is isomorphic to Arc(F a ), this time there is a finite group 
Q a acting on a[a] * Arc'(F a ), and the natural mapping Int(a[a] * Arc'(Fa)) — > A'(F) 
induces a homeomorphism between its image and the quotient Int(a[a] * Arc' (Fa)) /Q a . 
Thus, the links of simplices are virtually arc complexes of bordered surfaces (i.e., spheres), 
so A-x is indeed an orbifold. The local groups Q a in the orbifold structure of A n are easily 
described and recognized as follows. Consider the stabilizer S a < PMC(F) of F a in F 
(which may permute the labeling of F a as a bordered surface) , and notice that the quotient 

G a = S a /PMC(F) 

is a finite group. For instance, a component of F a might be an annulus of type F Q ^ n, 
and there may be some G PMC(F) which cyclically permutes the n distinguished points 
on a common boundary component; the resulting coset corresponds to a "fractional Dehn 
twist" . 

The starting point for the sphericity conjecture was the request / challenge from Ed Witten 
in 1989 to find an orbifold compactification of moduli space together with a cell decompo- 
sition which restricts to the usual one on moduli space itself. The sphericity theorem for 
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multiply punctured disks (proved in §5) already proves orbifoldicity for punctured spheres 
as noted in [P3] . The sphericity conjecture then gives orbifoldicity of our new compactifi- 
cation for any punctured surface. 

Given a quasi triangulation r of F = based at n, we may formally add in a choice- free 
manner an edge to r in each punctured component of F T to produce an ideal triangulation 
t + of F. There is a fatgraph G in F which is Poincare dual to r + in F. On this level of 
dual fatgraphs, there is one loop of G for each puncture of F other than 7r; these loops 
are taken formally here in the sense that we associate a lambda length to each such loop, 
where these lambda lengths are taken to be large relative to the non-loop lambda lengths 
and are otherwise unrestricted. 

The promised description of degenerate structures is given by the next theorem. 

Theorem 16 [P3; Theorem 7] Fix a quasi triangulation r of Fg with associated ideal 
triangulation t + . Let Ej(t) be a degeneration of simplicial coordinates on r + , and let 
(5 denote the collection of arcs in t + so that the corresponding lambda lengths do not 
diverge. Fixing a horocycle about tv, each component of Fp comes equipped with a canonical 
decoration at the cusps on the boundary. The polygons and once-punctured polygons of Fp 
combine to give a decorated hyperbolic structure on a bordered subsurface GF , and the 
complement TF = F — GF is regarded as a topological bordered surface (with no hyperbolic 
structure) together with a decoration coming from the specified horocycle. 

Thus, a point in gives rise to a geometric part GF C F together with a topological 
part TF C F, and the topological part is non-empty if and only if the point is ideal. One 
is thus permitted to "forget" the hyperbolic structure on a subsurface TF. In particular, 
the Deligne-Mumford compactification corresponds to "forgetting" the structure only on 
disjoint collections of annuli TF C F. 

An elementary observation is that any graph G admits a canonical decomposition into its 
recurrent part RG and its non-recurrent part NG, as follows. We shall say that a closed 
edge-path P = ei, e2, . . . , e n on G, thought of as a concatenation of edges, is "edge-simple" 
provided: 

i) Gi 7^ ej+i for any i (where the index % is taken modulo n + 1); 

ii) P never twice traverses the same oriented edge of G. 

The decomposition is given by: 

RG = {e E G : 3 edge — simple curve through e}, 
NG = G - RG. 

The next result improves and slightly corrects an earlier result. 

Theorem 17 [P3; Theorem 5] Consider a degeneration of hyperbolic structure, where 
simplicial coordinates Ei(t) — > for the indices i & I and lambda lengths ej(t) — > oo for 
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indices j G J. Then J C I. Furthermore, letting Gj = {e G G : e G 1} and likewise Gj, 
we have R(Gi) = Gj. 

Indeed, that J C I follows directly from the second part of Lemma 5, and the first part of 
Lemma 5 is applied to prove the remaining assertions about recurrent subgraphs. 

The combinatorics of our compactification is clear on the level of arc families based at n in 
a surface F = F*. Cells simply correspond to arc families which may or may not quasi fill, 
and the topological part of the decomposition of F, if any, arises from the complementary 
components which are different from (perhaps once-punctured) polygons. The graphical 
formulation is somewhat complicated but interesting nonetheless, and we finally describe 
it. Given the quasi triangulation r of F, complete it to an ideal triangulation r + as before 
with corresponding dual marked fatgraph G' = G' T . In contrast to the previous discussion, 
we here simply remove all the loops to produce a marked fatgraph G = G T , where G has 
s — 1 external nodes, and we label these external nodes with the indices 2, . . . , s of the 
corresponding punctures. 

One considers Whitehead moves (i.e., contraction/expansion of edges of the fatgraphs) as 
usual, and now also another elementary move supported on a surface of type (2)- This 
new move interchanges the two possible quasi triangulations of this surface, so on the level 
of dual fatgraphs, given a puncture i, the new move just flips the edge containing the 
terminal node labeled i by altering the fattening at its other vertex. In fact, this new 
elementary move is just the composition of two Whitehead moves on ideal triangulations 

<rf*0,(2)- 

We now two-color the edges of G by specifying "regular" and "ghost" edges, where we 
thing of ghost edges as "missing" , and there must be at least one regular edge. 

Take equivalence classes of these two-colored fatgraphs with labeled terminal nodes under 
Whitehead moves and the new elementary move, where we are only allowed to perform 
moves along the ghost edges. Furthermore, once an edge is a ghost edge, it remains a ghost 
("once you're dead, you're dead"), and otherwise the color of an edge is unaffected. For 
instance, in order to collapse a single edge with distinct vertices in a cubic fatgraph to a 
quartic vertex, instead, put a ghost icon on the edge to be collapsed; collapse and expand 
it to produce another cubic fatgraph with a ghost icon on the new edge, and identify these 
two ghostly fatgraphs. 

Theorem 18 [P3; Theorem 9] The dual of the simplicial complex A' n is naturally isomor- 
phic to the geometric realization of these marked two-colored labeled fatgraph equivalence 
classes. 

An obvious project is to compute the virtual Euler characteristics using this two-colored 
matrix model. It is also worth mentioning that the graphical formulation of Arc(F) for F 
a bordered surface can be described in analogy to this discussion, where each distinguished 
point in the boundary gives rise to a "hair" in the natural way. The dual of Arc'(F) is 
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naturally isomorphic to the set of two-colored fatgraphs with s external nodes and A hairs 
with external nodes, both labeled, modulo Whitehead moves on the ghost edges, where 
each boundary component has at least one hair and hairs are never allowed to be ghosts 
(which has a macabre corporeal interpretation). 

7. Operads 

Several topological and homological operads based on families of projectively weighted 
arcs in bordered surfaces F* are introduced and studied in [KLP]. This work as well as 
further material [Kl] due to Kaufmann are surveyed in his paper [K2] in this volume, so 
our treatment here will be brief and contextual. The spaces underlying the basic operad 
are identified with open subsets of Arc(F) which contain Arc#(F) for suitable families 
of bordered surfaces F. Specifically, say that an arc family a in F is exhaustive if for 
every boundary component of F, there is some arc in a which is incident on this boundary 
component, and define 

Arc s g (n) = {[a] G Arc(Fg n+1 ) : a is exhaustive}. 

We shall see that "gluing in the spirit of train tracks" imbues Arc = {Arc s g (n)} n >i 
with a cyclic operadic structure which restricts to a sub-operadic structure on Arc cp = 

{-Arcg(n)} n >i; there are also natural twistings Arc (n) = Arc s g (n) x (5' 1 ) n+1 with related 
operadic structures for which we refer the reader to [KLP]; this trivial Cartesian product 
is interpreted as a family of coordinatizations of the boundary of F which are twisted by 
Arc g (n). 

Already algebras over Arc cp < Arc are shown in [KLP] to be Batalin-Vilkovisky algebras 
where the entire BV structure is realized simplicially. An example of this simplicial re- 
alization is given by the symmetric diagram which describes the BV equation itself, as 
described in [K2]. Furthermore, this basic operad contains the Voronov cacti operad [Vo] 
up to homotopy. 

Our operad composition on Arc depends upon an explicit method of combining families of 
projectively weighted arcs in surfaces, that is, each component arc of the family is assigned 
the projective class of a positive real number; we next briefly describe this composition. 
Suppose that F 1 , F 2 are surfaces with distinguished boundary components d 1 C F 1 , d 2 C 
F 2 . Suppose further that each surface comes equipped with a properly embedded family 
of arcs, and let denote the arcs in F l which are incident on d l , for i = 1, 2, as 

illustrated in part I of Figure 10. Identify d 1 with d 2 to produce a surface F. We wish 
to furthermore combine the arc families in F X ,F 2 to produce a corresponding arc family 
in F, and there is evidently no well-defined way to achieve this without making further 
choices or imposing further conditions on the arc families (such as p 1 = p 2 ). 

The additional data required for gluing is given by an assignment of the projective class of 
one real number, a weight, to each arc in each of the arc families. The weight Wj on a* is 
interpreted geometrically as the height of a rectangular band = [0, 1] x [—Wj/2, Wj/2] 
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whose core [0, 1] x {0} is identified with a*, for i = 1,2 and j = 1, . . . ,p l . If we assume 

12 

that Y7j=i w j = Y7j=i w ] f° r simplicity, so that the total height of all the bands incident 
on d 1 agrees with that of d 2 ; in light of this assumption, the bands in F 1 can be sensibly 
attached along d 1 to the bands in F 2 along d 2 to produce a collection of bands in the 
surface F as illustrated in part II of Figure 10; notice that the horizontal edges of the 

1 2 

rectangles {R)}\ decompose the rectangles {Rj}\ into sub-rectangles and conversely. 
The resulting family of sub-bands, in turn, determines a weighted family of arcs in F, one 
arc for each sub-band with a weight given by the width of the sub-band; thus, the weighted 
arc family in F so produced depends upon the weights in a non-trivial but combinatorially 
explicit way. This describes the basis of our gluing operation on families of weighted arcs, 
which is derived from the theory of train tracks and partial measured foliations [PH]. In 
fact, the simplifying assumption that the total heights agree is obviated by considering not 
weighted families of arcs, but rather projectively weighted families of arcs since we may de- 
projectivize in order to arrange that the simplifying assumption is in force (since p 1 p 2 ^ 
by exhaustiveness) , perform the construction just described, and finally re-projectivize. 
The gluing operation just described may give rise to bands which are embedded annuli 
foliated by simple curves, and these are simply discarded to finally produce a resulting 
projectively weighted arc family which is again exhaustive. 




I II 



Figure 10 Gluing weighted arc families as train tracks 



Enumerate the boundary components of F l as <9q, d{, . . . , d l n . once and for all for % = 1, 2; 
the gluing operation applied to boundary components d\ and <9q induces the usual o i 
operation for i = 1, . . . , n\. Letting S p denote the p th symmetric group, there is a natural 
S n +i-action on the labeling of boundary components which restricts to a natural S n -action 
on the boundary components labeled {1, 2, . . . , n}. Thus, S n and S n+ i act on Arc s g {n), 
and extending by the diagonal action of S n+ i on (5' 1 ) n+1 , the symmetric groups S n and 
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S n+ i likewise act on Arc g {n), where S n by definition acts trivially on the first coordinate 
in (S 1 )^ 1 . 

Theorem 19 [KLP; Theorem 1.5.2-3] The compositions o 4 are well-defined and imbue 
Arc cp with the structure of a topological operad under the natural S n -action on labels. 
The operad has a unit and is cyclic for the natural S n +i-action. Arc likewise inherits the 
structure of a cyclic operad with unit. 

Just to give the flavor, we illustrate as one-parameter families of weighted arc families in 
Figure 11 the various operations (the identity, BV operator, dot product, and star product), 
where the parameter s satisfies < s < 1 and the outermost boundary component is labeled 
zero by convention. In this same spirit, the BV equation itself is given by an explicit and 
symmetric chain homotopy of weighted arc families as explained in [KLP] or [K2]. 



Figure 11 The identity, BV operator, dot product, and star operator 

Just as for cacti, Arc cp acts on the loop space of any manifold; does this action extend to 
an action of Arc or Arc itself? Does our formalism allow for a coherent description and 
calculation of the full string prop or its compactification? Does the BV structure of Arc 
or Arc cp survive injectively on the level of homology? 

As to the calculation of the homology of Arc cp , the planar sphericity conjecture is of utility 
since the complement of Arc cp (n) in Atc(Fq +1 ) is a finite union of spheres Atc(Fq ), 
where r+s = n+1. This arrangement of spheres should be analyzed using a Mayer- Vietor is 
argument, and the homology of Arc cp itself could then be computed using Alexander 

duality in the sphere Arc(FQ n+1 ); the twistings Arc cp could also then be computed using 
the Kiinneth formula. Of course, the full sphericity theorem is likewise useful in calculating 
the homology operad of Arc itself, of twistings such as Arc, and presumably also of the 
string prop. 



Appendix. Biological Applications 

The prediction of "macromolecular folding" is a central problem in contemporary biology. 
The "primary structure" (i.e., the sequence of nucleotides C,G,A,U for RNA, or of 20 
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amino acids for a general protein) can be determined experimentally, and one wishes to 
predict from it the full three-dimensional structure or "folding" of the macromolecule. We 
have glossed over interesting and important mathematics here, the problem of "sequence 
alignment", one aspect of which is to re-construct the entire primary structure of the 
macromolecule from empirically known snippets. 

The basic model of the "denatured macromolecule" is I = I m = [0, m] C Z C R for 
some m. A general "bonding" on I m is simply a collection B of unordered pairs 
where i,j G Z, \i — j\ > 1, and < i,j < m. Various pairs of "sites" i,j G I are 
"bonded" if there is a corresponding element {i,j} G £>, and we may consider the semi- 
circle Cij CRx R>o Q R 2 with endpoints (i, 0), (j, 0). One thinks of R as the "backbone" 
of the macromolecule, a tensile rod which bends to allow the bonding, where each Cij 
is collapsed to a point; in particular, the backbone is sufficiently rigid to prohibit any 
bond with \i — j\ = 1. We say that a bonding B is a "secondary structure" if 

fl Cki Q R for all {i,j},{k,£} G £>, whence {C^ : G £>} is an arc family in 

R x R>o ^ Z x R>o ^ Im x R->o- The secondary structure is said to be "binary" 
provided Cij fl Cm ^ implies {i,j} = {k,£}. A "helix" of length I is a collection 
{i + l,j — 1}, . . .{i + £,i — £} G B, whence it is a sequence of consecutive bonds. 
There is a natural binary secondary structure associated with an arbitrary one as illustrated 
in Figure 12. 




Figure 12 Binary from arbitrary foldings 



An "RNA secondary structure" is a binary secondary structure on I m for some m. This 
evidently depends upon a plane of projection, but in practice, there seems to be a more 
or less well-defined plane of projection for the arc family. (This is the plane the biologists 
draw for RNA which contains most of the bonding.) In any case, the definition of an RNA 
secondary structure is mathematically complete. In practice in R 3 , there is further bond- 
ing between sites which does not lie in the plane of projection, and this is the "tertiary 
structure" or full three-dimensional folding, of which we shall give mathematical formula- 
tions later. The prediction of tertiary from primary structure for RNA is the full folding 
problem. The prediction of the biologically correct secondary structure from the primary 
structure for RNA is another example of the macromolecular folding problem, which we 
shall discuss further below. 

For a general protein, the "secondary structure" is a decomposition of it into two standard 
motifs: the a-helix and the /3-sheet. The a-helix is an (essentially always right-handed) 
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helix, and the /3-sheet is a motif where the macromolecule repeatedly folds back upon itself 
in a plane. (There are other basic motifs for general proteins which can be added to the 
definition of its secondary structure, but we shall not take this up here.) Again in any 
case, the full three-dimensional folding of a general protein is its "tertiary stucture" , and 
the folding problem is to predict secondary or tertiary from primary structure. 

In the context of general proteins, the most effective methods of folding prediction are 
based upon motif searches and matches with a database of primary structures whose 
foldings have been experimentally determined by X-ray crystallography or other methods 
(and these experiments are delicate and relatively few). This is obviously disappointing 
to a theorist seeking ab initio methods based on physical or mathematical principles. Any 
serious such model should evidently incorporate what seems to be the main driving force 
of "hydophobicity/hydrophillicity" (the tendency of individual nucleotides, amino acids or 
sequences of them to shun/seek water in the biological environment), but experts seem 
to think that an effective thermodynamic model based on hydrophobicity is just pie-in- 
the-sky. Furthermore, structural considerations determined by the stoichiometry of the 
primary structure also seem to play a role. Effective prediction for biologically interesting 
primary sequences of length 10 2 — 10 9 may be too complex for any ab initio methods. 

For RNA secondary structures, however, there are rigorous and reasonably reliable ther- 
modynamic models, the "standard" one being Mfold [ZMT,ZS] which is defined as follows. 
The input is the primary structure (that is, a long sequence of letters C,G,A,U); the 
output is not only the optimal (i.e., global energy minimizing) folding or foldings but also 
the "sub-optimal" band of the local minima whose values are within, say, five percent of 
the global minimum. Experience seems to indicate that the correct foldings at least lie in 
such a sub-optimal band. The thermodynamics is governed by "nearest neighbor bonds" , 
that is, Boltzmann weights are assigned to bonds and to pairs and triples of bonds in a 
helix at adjacent sites; the energies of these configurations at fixed temperature have been 
determined experimentally, and the thermodynamic model can be solved (for instance 
stochastically) on the computer. This model apparently very accurately predicts melting 
points of RNA in most known examples. 

Bondings are naturally partially ordered by inclusion, and the restriction to secondary 
structures on a fixed number of sites thus has a geometric realization given by an arc 
complex for a polygon as was observed in [PW]. By the Classical Fact 11, this space of 
secondary structures is a sphere. Statically, this gives combinatorial information about the 
space of secondary structures, and dynamically, one imagines the evolution of a secondary 
structure in time as a dynamical system on a suitable sphere. Also proved in [PW] is 
the fact that an appropriate space of all RNA secondary structures is likewise homotopy 
equivalent to an arc complex (for instance, the space of all RNA secondary structures with 
no non-trivial helices and a fixed number of unbonded sites is again homeomorphic to the 
arc complex of a polygon), and a novel constructive proof (based on train tracks) of the 
Classical Fact 11 is presented. 

In this context, the planar sphericity conjecture has the immediate interpretation as cap- 
turing the combinatorics of several interacting secondary structures on macromolecules 
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in both the general and binary contexts. Any model of full three-dimensional folding of 
systems of macromolecules should be explicable in terms of arc families, and towards this 
end, we next describe several recent models of "pseudo-knotting" of secondary structures. 
Among the many elaborations and extensions of the basic thermo dynamical model Mfold 
described before, [IS] and especially [XBTI] emphasize a related notion of pseudo-knotting 
among other innovations. 



The first model we propose is the three-dimensional folding of a single binary macro- 
molecule. 

Binary Chiral Model F Given a binary bonding B on 7 m , construct a bordered 
surface F' C R 3 as follows. Attach to I m x [0, 1] C R 2 C R 3 a collection of bands 
in the natural way, one band flk for each bond {ii,jk} £ £> 5 where the core of the 
band (3k agrees with Ci k j k ; the ^-coordinates are adjusted in any convenient way to 
arrange that the bands are all disjointly embedded. Consider a boundary component 
K of F'; as illustrated in Figure 13, there are two cases for K: either the projection 
of K bounds a polygon, or it does not, and we adjoin to F' an abstract polygon 
along each component of the former type to produce a surface F. The boundary 
components of F are called "pseudo-knots". Notice that altering the lengths of 
helices or breaking helices by adding free sites leaves invariant the topological type 
of F, and F is planar if and only if the folding is a secondary structure. 




Figure 13 Boundary components of F'. 

The "achiral model" for binary folding allows each endpoint of a band to be attached 
either above the x-axis or below the x-axis (so there are four ways to attach a band, 
and more data is required for each band) . In the same manner as before, we may build 
a surface F' and cap off certain of its boundary components to produce another surface 
F, whose boundary components are the pseudo-knots. The chiral and achiral models for 
arbitrary macromolecules are defined similarly. It is worth saying explicitly that there 
are further elaborations of these four basic models (chiral/achiral for binary /arbitrary) 
tailored to other natural biophysical circumstances. 

We had already studied in [P2] a thermodynamic model with Boltzmann weights given 
by simplicial coordinates; this physics project was undertaken as a pure mathematician 
since the structure and attendant calculations were pleasing. The achiral model in the 
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binary case is closely related to the thermodynamic model described in [P2]. (The two 
cyclic orderings on a set with three elements in [P2] corresponds to the bond being 
attached above or below the x-axis.) 

There is no doubt that the combinatorics of suitable models of macromolecules are 
described by arc complexes. Whether protein folding will be effectively predicted by 
ab initio or by refined empirical motif recognition, a better understanding of this com- 
binatorics should be biologically useful. 

Speculative Remark As we have seen in §2, the simplicial coordinates are effectively 
volumes in Minkowski space, so in the spirit of [Kh] and [AS] , the thermodynamic model 
in [P2] has "the Boltzmann weights as volumes". Might the convex hull construction 
itself give a method of predicting RNA or protein folding? The key question is how to 
impose the primary structure of the macromolecule on the geometry. Perhaps of utility 
in this regard, [KV] describes a hyperbolic version of conformal field theory, which is 
especially interesting to us because the authors argue that it models suitable polymers; 
furthermore (without realizing it!), the authors employ the special and unnatural case 
of decorated Teichmiiller theory in which all of the horocycles have a common Euclidean 
radius e in a model of upper half space, and then take the limit e — > 0. Essentially 
the entire analysis of [KV] applies in our setting and suggests an explicit model of 
"hyperbolic field theory" , where the two-point function is the lambda length. 
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